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ALTERATION OF ACYLTRANSFERASE DOMAIN SUBSTRATE SPECIFICITY 

CROSS-REFERENCE TO RELATED APPLICATION 

5 [0001 ] The present application claims the benefit of the filing date of 

provisional U.S. patent application Serial No. 60/310,730, filed August 7, 2001, 
incorporated herein by reference. 

REFERENCE TO GOVERNMENT FUNDING 

10 10002] This invention was supported in part by SBIR grant DK57380. The 

U.S. government has certain rights in this invention. 

FIELD OF THE INVENTION 

[0003] The present invention provides methods for altering the subunit 
1 5 specificity of acyltransferase domains of polyketide synthase (PKS) enzymes, as well as 
PKS altered by the method, recombinant vectors encoding them, host cells expressing 
them, and the polyketides so produced. The present invention relates to the fields of 
molecular biology, biochemistry, medicinal chemistry, and medicine. 

BACKGROUND OF THE INVENTION 

20 [0004] Polyketides are a structurally diverse class of natural products that 

have played an important role in the development of therapeutic and agricultural 
chemicals (7, 2 — all references cited herein are incorporated herein by ref&rence and 
indicated by a number corresponding to the numbered location in the list of references 
cited at the conclusion of the ''Detailed Description of the Invention", below). Many 

2S polyketides possess antibiotic, anticancer, immunosuppressive or other useful biological 
properties. In recent years, researchers have developed a number of protein engineering 
strategies to manipulate polyketide synthases (PKSs), the enzyme complexes responsible 
for biosynthesis of polyketides, in order to generate novel chemical structures {3, 4). 
. PKSs assemble the carbon backbopes of polyketides through repeated condensations 

30 between acyl-CoA thioesters, typically malonyl, methyhnalonyl and ethylmalonyl. Thus, 
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the diversity observed in polyketide structures is in part derived from the incorporation of 
different starter or extender monomers, v^hich is controlled by the PKS. 

[0005] In modular PKS systems, such as the 6-deoxyerythronoUde B synthase 
(DEBS), selection of the extender unit for each module, as well as the starter unit in many 
5 cases, is carried out by an acyltransferase (AT) domain. An AT domain catalyzes the 
transacylation of the monomer unit from CoA to the phosphopantethiene arm of the acyl 
carrier protein (AC?) in the same module. AT domains generally possess a stringent 
specificity for a single acyl-CoA substrate in their natural context, although some ATs 
can incorporate at least two different monomers with similar efficiency. Alignments 

1 0 between methylmalonyl-CoA (mmCoA) specific and malonyl-CoA (mCoA) specific AT 
domain sequences always cluster into two groups according to the specificity of the 
domain (5-7). Haydock et al. have identified two possible divergent sequence motifs 
based on such alignments and comparison to the Escherichia coli malonyl-CoA: ACP 
transacylase (FabD) crystal structure {8, 9) but have not demonstrated the contribution of 

1 5 these motife to substrate specificity experimentally. In a separate study, Lau et aL used in 
VIVO experiments to locate a different variable region at the C-terminal segment of AT 
domains that may influence specificity {10), Because this segment of the AT shows little 
homology between different AT domains, it was not clear how these amino acids 
contributed mechanistically to substrate selectivity. 

20 [0006] Despite the elucidation of potential regions within ATs for engineering 

specificity, the most popular method of changing substrate utilization in PKSs has been 
via cassette mutagenesis of AT domains in which the entire -300-350 amino acid domain 
of one module is exchanged vsdth a homologous AT cassette that encodes a different 
starter or extender unit fi-om another, usually heterologous, PKS module. This technique 

25 has proven to be generally robust. A number of successful AT replacements with modular 
PKSs have been reported, in each case producing a new polyketide with the predicted 
change {11-16), However, occasionally AT replacements are only marginally successful 
or entirely unsuccessful, leading to only very small amounts of the desired compound or 
no product at all {14). For example, an AT cassette firom module 2 of the rapamycin PKS 

30 was used successfully to alter the extender unit specificity from methylmalonate to 
malonate in modules 1, 2, 3, 5, and 6 of DEBS (7(5). However, this same AT cassette 
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failed to produce a polyketide at detectable levels under the conditions tested when it was 
introduced in module 4 of DEBS. 

[0007] Assuming that the genetic engineering of the PKS is properly 
executed, there are two possible reasons for such low production. Either the foreign AT 
5 domain causes the PKS complex to fold incorrectly and lose, in whole or in part, a 

necessary activity, or the replacement leads to a modified polyketide chain that is not or 
only poorly recognized as a substrate by a subsequent ("downstream**) PKS activity. In 
the case of the former, it might be possible to overcome this liability if the AT domain 
specificity was engineered in a manner that minimized perturbation to the tertiary 

10 structure of the module - e.g. mutagenesis to a limited nxmiber of amino acids. To do so, 
however, requires an understanding of the rules for proper PKS folding and the structural 
features involved in substrate recognition. Thus, there remains a need for methods to alter 
the domain specificity of an AT domain of a modular PKS such that the resulting PKS 
can produce a polyketide, and there remains a need for polyketides produced by the 

15 method, including but not limited to the polyketide 6-desmethyl-6-deoxyerytlironolide B 
and the erythromycin and motilide compounds corresponding thereto. 

SUMMARY OF THE INVENTION 

[0008] The present invention provides a method for the altering the AT 
20 specificity of a module of a PKS by changing a limited number of the amino acid residues 
in the domain. In one embodiment, site-specific mutagenesis is used to make the changes 
in the coding sequence of a nucleic acid that encodes the AT domain. In another 
embodiment, the AT domain is from extender module 4 of DEBS. 

[0009] The present invention also^rovides recombinant PKS enzymes 
25 comprising one or more AT domains altered by the method^ as well as vectors encoding 
such enzymes, and host cells comprising such vectors. 

[001 0] The present invention also provides polyketides produced by the host 
cells of the invention. In one embodiment, the polyketide is 6-desmethyl-6- 
deoxyerythronolide B, a glycosylated or hydroxylated derivative thereof, and 
30 corresponding erythromycins thereof. 
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[001 1 1 In another embodiment the erythromycins corresponding to 6- 
desmethyl-6-deoxyerythronolide B are produced by providing the compound to a culture 
of Saccharopolyspora erythraea under conditions such that the corresponding 
erythromycins are produced. In another embodiment, the erythromycins corresponding to 
5 6-desmethyI-6-deoxyerythronolide B are produced by expressing the DEBS genes of the 
preset invention in Saccharopolyspora erythraea and culturing the host cell comprising 
those genes under conditions such that the genes are expressed and the corresponding 
erythromycins are produced. 

[0012] These and other embodiments, modes, and aspects of the invention are 
10 described in more detail in the foUov^ing description, the examples, and claims set forth 
below, 

DETAILED DESCRIPTION OF THE INVENTION \ 

[0013] Utilizing the crystal structure of E. coli FabD (P) and amino acid 
15 alignments of different modular PKS AT domains, key residues predicted to contribute to 
the specificity of AT domains were identified, and based on these predictions, three 
independent site-directed mutant ATs in module 4 of DEBS were constructed. Two of the 
regions engineered corresponded to those previously identified by Haydock et aL as 
important for specificity (5), whereas the third did not. All of the mutants permitted the 
20 incorporation of a malonate (as well as a methytaialonate) unit at the corresponding 
position of the polyketide, thereby producing 6-desmethyl-6-deoxyerythronolide B (6- 
desmethyl-6-dEB) for the first time. A series of cassette substitutions in module 4 using 
nialonyl-CoA specific AT domains fi-om four different heterologous sources as well as a 
hybrid module 4, however, did not produce any polyketide. Together, these results 
25 suggest that module 4 of DEBS is particularly sensitive to heterologous domain 

replacement and that the methods of the present invention are useful alternative methods 
for engineering specificity of modular PKSs. 

[0014] All six of the extender AT domains in DEBS are specific for 2(S)- 
methylmalonyl-CoA (27). Five different 6-deoxyerythronolide B (6-dEB) analogs have 
30 been produced by replacing the AT domains from modules 1, 2, 3, 5, or 6 of DEBS with 
the AT domain j&om module 2 of the rapamycin PKS (rapAT2) {16). Production titers of 
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the corresponding 12-, 10-, 8-, 4-, and 2-desmethyl-6dEB analogs ranged from -4-70% of 
the amount of 6-dEB produced by unmodified DEBS. This same cassette was also used to 
replace the AT domain from module 4 of DEBS (AT4) as described in the Examples 
below. Expression of this PKS in 5. lividans did not lead to any detectable levels of the 
5 desired polyketide 6-desmethyl-6-dEB. This result was somewhat surprising, because the 
cassette functioned in every other DEBS module. 

[0015] To determine if the problem resulted from an incompatibility between 
the rapAT2 domain and DEBS module 4, three additional AT swaps were made using 
ATs from the FK520 PKS and the epothilone PKS (EpoPKS). Each of the chosen ATs, 

10 flc520AT10, epoAT3 and epoAT4, posses a greater sequence similarity to DEBS AT4 
than does rapAT2. The first two AT domains incorporate only malonate extender units in 
their native PKSs whereas the epoAT4 domain appears to be capable of incorporating 
either malonate or methylmalonate in epothilone biosynthesis {18). Therefore, a 
fimctional substitution with this domain should result in production of both 6-dEB and 6- 

1 5 desmethyl-6dEB in the presence of both precursors. An alternative set of jimctions 
between the native and heterologous sequences was also used in these replacements to 
reduce the amount of heterologous sequence introduced, while still retaining those 
regions believed to be important for specificity. The new sites were 43 amino acids 
downstream and 60 amino acids upstream of the right and left junctions, respectively, 

20 used in the substitution with the rapAT2 cassette. Again, however, the expected 6- 

desmethyl compound could not be detected when the recombinant PKSs were expressed 
in S. lividans. Furthermore, 6-dEB, the natural product of DEBS, was also absent bom 
the epoAT4 swap. 

[001 6] Another concern for the apparent inability of malonyl-CoA specific 
25 ATs to fiinction in module 4 of DEBS was the specificity of the downstream enzymatic 
domains which reduce the beta-ketone to a methylene after condensation with the 
, extender unit. Module 4 contains ketoreductase (KR), dehydratase (DH) and 
enoylreductase (ER) domains of which one or more could possess severely attenuated 
activity for the alpha-desmethyl pentaketide intermediate. A hybrid module containing 
30 the KS from DEBS module 4 and the AT, DH, ER, KR, and ACP domains from the 
epothilone PKS module 5 was constructed. The EpoPKS module 5 incorporates a 
malonate extender unit and so the reductive domains from this module process a 
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polyketide intermediate with the appropriate alpha-carbon functionality. This PKS also 
did not produce any compound related to 6-dEB that could be detected. 

[0017] Because neither of the above approaches afforded the desired 6- 
desmethyl analog, the structurally more conservative methods of the present invention 
5 were developed and used to generate the compound. The E. coli FabD crystal structure 
was studied to locate regions in or near the active site with amino acid sequences showing 
a strong correlation between malonyl and methylmalonyl specificity in AT alignments. 
Three such regions were identified and targeted for mutagenesis of DEBS AT4; in each 
case, the region was mutated fix)m the canonical mmCoA-specific motif to the canonical 

1 0 mCoA-specific motif using site-directed mutagenesis, as described below. The method 
can be employed in reverse to change a mCoA-specific motif to ammCoA-specific motif. 
Region 1, immediately upstream of the highly conserved Gin residue in the active site 
(Gln-63 in FabD), and region 2, a single amino acid adjacent to the active site serine 
residue (Ser-92 in FabD) have been implicated in specificity of AT domains based on 

15 sequence alignments {8) but have never been directly proven to play a role in specificity. 
Region 3 is adjacent to a highly conserved histidine residue (His-201) that lies near the 
active site serine in three dimensional space. 

[0018] Thesejegions are shown below utilizing the DNA and amino acid 
sequences corresponding to the DEBS module 4 AT. 
20 Region 1 - in DEBS module 4 AT, specific for methybnalonyl CoA: 

6229 6249 
GAG CGC GTC GAC GTG CTC CAG 
E R V D V L Q 

25 Region 1 - in mutated DEBS module 4 AT of the invention, able to bind malonyl and 
methylmalonyl CoA: 

6229 6249 
GAG GAC GTC CTC TAG GCC CAG 
ED V L y A Q 

30 

Mutations to provide the amino acid sequences EDDLYAQ and EDTLYAQ are also 
preferred for Region 1. 
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Region 2 - in DEBS module 4 AT, specific for methylmalonyl CoA: 

6325 6345 
GGC CAC TCG CAG GGC GAG ATC 
5 G H S Q G E I 

Region 2 - in mutated DEBS module 4 AT of the invention, able to bind malonyl and 

methylmalonyl CoA: 

6325 6345 
GGC CAC TCG CTC GGC GAG ATC 
10 G H S L G E I 

Mutations to provide the amino acid sequences GHSVGEI and GHSIGEI are also 

preferred for Region 2. 

Region 3 - in DEBS module 4 AT, specific for methylmalonyl CoA: 

6625 6645 
1 5 GTG CGC TAC GCC TCC CAC TCC 

V R Y A S H S 

Region 3 - in mutated DEBS module 4 AT of the invention, able to bind malonyl and 

methylmalonyl CoA: 

20 6625 6645 
GTG CGC CAC GCC TTC CAC TCC 

V R H A F H S 

Mutation to provide the amino acid sequence VRHGFHS is also preferred for Region 3. 

25 [0019] All three of the mutant PKSs yielded similar product profiles when 

expressed in S. lividans. Based on LC/MS analysis, each PKS produced a combination of 
6-dEB and two additional compounds, each whose mass was 14 amu smaller than 6-dEB. 
One of these displayed the same retention and mass fragmentation as 8,8a- 
deoxyoleandolide (8,8a-d01e). This compound is formed by priming of the DEBS loading 

30 domain with acetate rather than propionate and is usually observed as a minor component 
when DEBS is expressed in heterologous hosts (22). The other compound was presumed 
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to be the desired 6-desmethyl-6dEB analog and was purified for structural 
characterization. *H-NMR and '^C-NMR spectroscopy established definitively that this 
compound was 6-desmethyl-6-dEB. Thus, in each of the PKSs, the specificity of the AT 
domain has been altered to incorporate a malonate extender unit, although not 
5 exclusively. The total yields of polyketides were - 5-1 0 mg/L under the conditions used. 
The ratios of the polyketide products for each PKS were slightly different, with the region 
3 mutations yielding the highest ratio of mCoA to mmCoA incorporation. 

[0020] There have been several examples of mmCoA to mCoA AT 
replacements reported that collectively include every module of DEBS except module 4 

10 {12-14, 16), Stassi et aL were able to make a functional AT swap in module 4 with an 
ethylmalonyl-CoA specific AT which was capable of incorporating either a butyryl unit 
in the presence of ethylmalonyl-CoA or a propionate unit if no ethylmalonyl-CoA was 
available (75). However, a substantial decrease in polyketide titer is believed to have 
accompanied the substitution. As noted above and in the Examples, the substitution of 

15 DEBS AT4 with four different heterologous mCoA AT cassettes did not yield any 

measurable polyketide production under the test conditions employed. The reason for the 
apparent low or non-production fi-om these substitutions is not known. However, the 
subsequent synthesis of 6-desmethyl-6-dEB by the site-specific AT4 mutations in 
reasonable yield demonstrates that DEBS can process the uxmatural pentaketide chain 

20 lacking an alpha-methyl. Together, these resuUs suggest that substitution of AT domains 
in module 4 might alter the conformation of the module so as to disrupt efficient transfer 
of the acyl-CoA substrate to the ACP domain. 

[0021] This hypothesis is best supported by the substitution writh the epdAT4 
cassette, which should not pose any substrate barriers to DEBS, because of its presumed 

25 relaxed mCoA and mmCoA specificity. Furthermore, some previously reported AT 

substitutions have resulted in a ^relaxation* of specificity, which suggests that the rest of 
the module can affect the conformation of a heterologous AT domain (75. 23). This 
hypothesis is also consistent with the observation that alterations to a region at the C- 
terminus of AT domains, and located at a considerable distance from the active site, can 

30 still affect substrate specificity (70), possibly through steric forces that distort the AT 
domain. 
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(0022] The E. coli FabD crystal structure shows five water molecules in a 
plane just above and surrounding the active site serine (Ser-92). Five atoms of an acyl- 
CoA substrate - the thioester carbonyl oxygen and sulfur atoms, the two carboxylate 
oxygen atoms, and the alpha carbon substituent atom - can be positioned in the active site 
5 such that they occupy nearly the same positions as these water molecules. 

[0023] The geometry of substrates in the active site of AT domain can be 
hypothesized to exist as follows. The cleft containing the active site is oriented vertically 
and lies in a plane slightly above the catalytic serine (Ser-92 in £. coli FabD). The rest of 
the CoA (or ACP) moiety rises up out of the active site clefl. This configuration places 
10 the carboxyl group of the substrate in proximity to a conserved arginine residue (Arg-1 17 
in FabD) and is therefore consistent with biochemical experiments that have directly 
imphcated this residue in carboxylate binding {24). In this model, the substituent on the 
alpha-carbon of an acyl-CoA substrate would point toward a hydrophobic pocket with a 
conserved Gta residue (equivalent to Gln-1 1 of E. coli FabD) lying immediately above it. 

1 5 10024] The following table shows an alignment of divergent motifs in PKS 

AT domains. The divergent sequence blocks at or near the active site of PKS AT domains 
that have a correlation with malonyl versus methylmalonyl specificity are shown. The left 
block is about 30 residues toward the N-terminus from the active site Ser in the middle 
block. The right block ends with the invariant His residue (ffis-201 in E, coli FabD) 

20 found in the active site. The residue just following the invariant GHS motif is a branched 
chain amino acid in all known malonyl-specific AT domains or a glutamine in most 
methylmalonyl-specific AT domains. It ig part of a hydrophobic pocket that may be 
involved in recognition of the different malonyl-CoA substrates. Abbreviations are epo, 
epothilone; sor, soraphen; asc, ascomycin; rap, rapamycin; nid, niddamycin. In the few 

25 cases where AT domains of the same specificity fi-om the same cluster had different 
sequences within a block, the consensus sequence is shown. 
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Table 3 

Alignment of Divergent Motifs in PKS AT Domains 
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[0025] In accordance with the methods of the present invention, three 
different motifs v/ithin the DEBS AT 4 domain can be engineered to change the substrate 
selectivity j&om mCoA exclusively to allow both mCoA and mmCoA. Because the 
20 intracellular concentrations of these two substrates are not known in S. lividans^ it is not 
possible to calculate the relative specificities of the mutant AT domains based on the 
polyketide titers. These could be addressed and calculated with in-vitro systems such as 
those recently developed for individual modules of DEBS (25, 2(5). 

[0026] Based on the E, coli FabD structure, all three of these regions lie 
25 within the active site cleft of the AT domain. Prior to the present invention, the divergent 
motifs of regions 1 and 2 could be used to predict the specificity of a mCoA or mmCoA 
AT domain, but it was not known if these motifs were directly involved in substrate 
selectivity or were only an indication of specificity resulting fi'om evolutionary [ 
constraints. Furthermore, the methods of the invention can be used to target a thnnd 
30 isolated region of AT domain generally. This third region has an even greater influence 
on specificity than the other two in the DEBS AT4 domain. 

[0027] The consensus motifs for regions 1 and 2 are the same for both 
mmCoA ATs as well as ethylmalonyl (em) AT domains; therefore, how these amino 
acids would contribute directly to binding of the two different substrates is not apparent. 
35 However, the consensus motifs for region 3 are different for mCoA (where the motif is 
His-X-Phe-His, typically His-Ala-Phe-His), mmCoA ATs (Tyr-X-Ser-His or Val-X-Ser- 
His, where X is typically Ala), emCoA ATs (Thr-Ala-Gly-His or Thr-Ala-Ser-His or Cys- 
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Pro-Thr-His), and methoxymalonyl (omem) CoA (Phe-Ala-Gly-His or Val-Ala-Gly-His). 
In the preceding motifs, X is independently selected from any amino acid. In particular 
embodiments, the mutations introduced in region 3 to provide an omemCoA specific AT 
domain are selected based on the sequence of the starting domain. Thus, if the AT domain 
to be mutated is a malonyl CoA specific domain having a region 3 motif HAFH, that 
sequence is changed to HAGH to yield an omemCoA specific domain. If the AT domain 
to be mutated is a methylmalonyl CoA specific domain having a region 3 motif YASH, 
tfiat sequence is changed to FASH to yield an omemCoA specific domain. If the AT 
domain to be mutated is an ethylmalonyl CoA specific domain having a region 3 motif 
TAGH, that sequence is changed to FAGH to yield an omemCoA specific domain. 

[0028] The methods of the invention thus allow one to change the specificity 
of one type of AT to another by targeting region 3 for mutagenesis as exemplified herein. 
The size differences in side chains between the Phe, Ser, or Gly residues immediately 
adjacent to the conserved His residue (His-201) could possibly account for the diffi^^ces 
in the corresponding acyl side chains of mCoA, mmCoA, emCoA, and omemCoA 
substrates. Regardless, the methods of the present invention allow alterations of the amino 
acid sequences in region 3 to change one motif to another and correspondingly to change 
the specificity of the AT domain so changed. 

{0029] It is surprising that all three regions can be altered individually to 
change the specificity of the domain. This result indicates that many different amino acids 
can contribute to substrate binding, perhaps either directly or indirectly. Therefore, it is 
likely that the degree to which the mutations described here will affect AT specificity will 
vary among different AT domains and PKSs. Alteration of two or all three of these 
regions could improve productivity and specificity and is within the scope of the present 
invention. High throughput screening techniques of isolated AT domains or modules 
could be used to find mutants with even better properties than those described here. 
Furthermore, the identification of these critical residues for substrate utilization could 
facilitate the tailoring of AT specificity for non-natural extender units and used in 
conjunction with recent precursor engineering techniques (27) to enhance the diversity of 
ketide units that can be used in combinatorial biosynthesis technologies. 
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[0030] Other features and advantages of the invention will be apparent from 
the following detailed description, and fix>m the claims. 

{003 1 ] The following references are referred to herein by the number shown 
to the left of each reference citation; each of the references cited is incorporated herein by 
5 reference. 
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[0032] The present invention is further described by the following examples. 
The examples are provided solely to illustrate the invention by reference to specific 
20 embodiments. These exemplifications, while illustrating certain specific aspects of the 
invention, do not portray the limitations or ciFCim:iscribe the scope of the disclosed 
invention. 



EXAMPLES 

25 Example 1 Strains, culture conditions, and DNA manipulation. 

[0033] All strains, culture conditions and DNA manipulations were essentially 
as described (77) unless noted otherwise. In the construction of the final S. lividans strains 
used for expression of pol>^etides, a second plasmid, pBOOST, was included. This 
plasmid, in conjunction with the pRM5-based expression plasmids used* here increases 
30 polyketide production by a mechanism that is currently under investigation (see U.S. 
patent application Serial Nos. 10/126,196, filed 19 April 02, and 60/259,289, filed 28 

Dec. 00, both of which are incorporated herein by reference). 

! 

Example 2 Construction ofAT replacements in DEBS Module 4. 

35 [0034] Four different AT cassettes were used for substitution of the AT 

domain in module 4. The rapAT2 cassette has been described (IS). The remaining three 
cassettes - FK520AT10, epoAT3, and epoAT4 - were PGR amplified fcom DNA 
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encoding the FK520 (7) and epothilone PKSs {18, 19) using oligonucleotide primers. For 
replacement of the rapAT2 cassette, BamHl and Pst I restriction sites were engineered in 
positions flanking the AT4 domain which correspond to the same positions used in other 
modules of DEBS with the same cassette through a series of subcloning and PGR steps 
5 using standard techniques. 

[0035] To replace DEBS AT4 with the other three cassettes (as well as the 
AT4 site-specific mutants discussed below), a different set of restriction sites was 
engineered within the AT4 domain. A subclone was generated in which unique Hind HI 
and Pst I sites were introduced in AT4 using PGR amplification of segments --Ikb 
10 upstream of the Hind III site, --Ikb downstream of the Pst I site and between the two sites. 
The three PGR products were assembled to give pK0S13 1-62A, which was verified by 
DNA sequencing. Both restriction sites are silent^ and the altered sequences are as follows 
(restriction sites underlined, altered nucleotides in bold): 
5*- 

15 CTCTCGCCGCACACCGACTG GAAGCTT CTGGACGTCGTCCGCGGCGACGGC: 
5^ 

CTGCTGATGGGGGTCGAGGAGACTGCAGAGCACGCGGGCGCGGAAGTGACC. 

[0036] The final expression plasmids were constructed as follows. The vector 
pKOSl 1-77 contains the DEBS genes with an engineered Spe I site downstream of the 

20 AGP of module 2 (20). This site appears to be functionally silent based on production of 
6-dEB by pKOSl 1-77 in 5. lividans (16). The Spe I to Bgl U fragment containing the 
gene for DEBS2 was subcloned from pKOSl 1-77 into Litmus 28 (New England Biolabs) 
and the Hind IE md Pst I sites were removed from the polylinker by cutting with Hind TR 
and Xba I and inserting a linker to give pKOS 1 3 1 -68. The BsiW I to BamH I fragment 

25 was subcloned from pKOSl 1-77 into Litmus 28 to give pKOS131-57. The Apa I to Not 1 
fragment of pKOS131-62A was moved into pKOS131-57 to give pKOS131-72B and the 
BsiW I to Fse I fragment from pKOS131-72B was moved into pKOS13 1-68 to give 
pKOS131-72A. Each of the three heterologous AT domains (FK520AT10, epoAT3 and 
epoAT4) were moved as Hind IHrPst I fragments into pKOS131-72A. At this stage the 

30 AT inserts of each subclone were confirmed by DNA sequencing. The Spe I to Bgl E 

fragment from each of the three subclones was moved into pKOSl 1-77 to obtain the final 
PKS expression vectors. 



14 



wo 03/014312 



PCT/US02/25094 



Example 3 Mutagenesis of DEBS A T4. 

5 [0037] Site-directed mutations were introduced in three regions of DEBS AT4 

using the Altered-Sites mutagenesis kit (Promega). The Hind 131-Pst I PGR product from 
AT4 was cloned into pAlter^ and the mutations described^for regions 1 » 2, and 3 above 
were introduced following protocols provided by the manufacturer. All mutations were 
verified by DNA sequencing. Each of the mutant cassettes was introduced into the Hind 
10 in and PstI sites of pKOS131-72A. Final expression plasmids were made analogous to 
the above AT substitutions. 

Example 4 Production arid Analysis of Polyketides. 

[0038] Each of the expression plasmids was introduced in S. lividans K4-1 14 
15 also containing pBOOST. Supematants of strains grown in R5 liquid (+ 50mg/L 

thiostrepton and 200 mg/L apramycin) were analyzed using an LC-MS system equipped 
with on-line extraction. The LC-MS system was comprised of a 10 port, 2 position 
switching valve/injector» Beckman System Gold HPLC, an AUtech ELSD detector, and a 
PE SCIEX APIlOO LC MS-based detector equipped with an atmospheric pressure 
20 chemical ionization source. For 6*dEB and 6-desmethyl-6-DEB, 100 microliters of 
clarified whole broth was loaded onto the guard column after a 1 min pre-equilibration 
with H2O at 1 ml/min. At 30 sec post-injection, a linear gradient to 1 S% MeCN ov^ 1 
min was initiated. At 2 niin the direction of flow through the guard column was reversed, 
^ and the eluent was diverted onto a Metachein Inertsil ODS-3 column (5 Dm, 4.6 X 150 
25 mm) pre-equilibrated with 15% MeCN. A linear gradient from 15 to 100% MeCN at 1 
ml/min over 6 min, then 100% MeCN for 3 min was monitored by ELSD and MS. 
Authentic purified 6-dEB and 8,8a-d01e were used as a controls for compound 
verification. 6-dEB was used to generate a calibration curve for polyketide titer measured 
by ELSD. 

30 

Example 5 Purification and characterization of 6-desmethyl'6'dEB. 

15 
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(0039J A host cell of the invention that produces 6-desmethyl-6-dEB, S. 
lividans K4-1 14/pKOS164-35G + pBOOST, was grown in 2L of R5 as above. The cells 
were removed by centrifugation, and the broth was filtered through a 2.S x 30 cm column 
of XAD-1 6 adsorbent resin. The resin was washed with two volumes of water and then 
5 eluted with four volumes of acetone. Hie acetone eluate was evaporated on a rotary 
evq)orator to a dark aqueous slurry, which was extracted three times with equal volumes 
of ethyl acetate. The organic extracts were combined, washed sequentially with saturated 
aqueous NaHCO} and brine, then dried over MgS04, filtered, and evaporated. The oily 

c 

yellow residue was dissolved in 2 ml of CH3CN, diluted with 2 ml of water, and filtered 

10 through a C18 Mega Bond-Elut (Varian) extraction column. The column was washed with 
10 ml of 1:1 CH3CN/H2O, and the total eluate was evaporated. Final purification was by 
preparative HPLC, using a 20 x 50 mm Cis InertSil (Metachem) column and a gradient 
&om 20% to 100% CH3CN in water at a flow rate of 10 ml/min. Detection was by UV 
absorbance at 210 nm, with peak identity confirmed by mass spectrometric analysis. The 

1 5 appropriate HPLC firaction was evaporated, yielding 2 mg of 6-desmethyl-6-d£B. 'H- 
NMR (CDCI3, 400 MHz): 8 5.154 (IH, ddd, J = 1 .2, 4.0, 9.6 Hz, H-13); 4.038 (IH, br d, J 
= 10.4 Hz, H-5); 3.984 (IH, br d, J = 10.8 Hz, H-3); 3.779 (IH, dd, J = 0.8, 4.8 Hz, OH- 
1 1); 3.710 (IH, ddd, J = 2.4. 4.8, 10.4 Hz, H-1 1); 3.021 (IH, d, J = 2.4 Hz, OH-3); 2.803 
(IH, dq, J = 6.8, 10.8 Hz, H-2); 2.698 (IH, br q, J = 6.8 Hz, H-10); 2.53 (IH, m, H-8); 

20 2.186 (lH,br s, OH-5); 1.82 (unres., H-14a); 1.80 (unres., H-6a); 1.80 (uraes., H-4); 1.74 
(unres., H-7a); 1.73 (unres., H-1 2); 1.65 (IH, m. H-6b); 1.53 (IH, m, H-14b); 1.318 (3H, 
d, J = 6.8 Hz, Me-2); 1 .260 (IH, m, H-7b); 1 .051 (3H, d, J = 6.4 Hz, Me-8); 1.022 (3H, d, 
J = 6.8 Hz, Me-10); 1.01 1 (3H, d, J = 6.8 Hz, Me-4); 0.937 (3H, t, J = 7.4 Hz, H-1 5); 
0.882 (3H, d, J = 6.8 Hz, Me-12). "C-NMR (CDCI3, 100 MHz): 8 213.02 (C-9), 178.52 

25 (C-1), 78.86 (C-3), 76.10 (C-13), 74.21 (C-5), 71.01 (C-1 1), 44.63 (C-2). 44.07 (C-10), ^ 
40.71 (C-12), 38.73 (C-8), 37.01 (C-4), 29.76 (C-6), 28.58 (C-7), 25.46 (C-14), 14.80 
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(Me-2), 12.28 {Me-8), 10.63 (C-15), 9.22 (Me-12), 6.05 (Me-10), 3.28 (Me-4). 
Unresolved 'H resonances were assigned from two-dimensional correlation experiments. 

[0040] Numerous modifications may be made to the foregoing systems 
without departing fi'om the. basic teachings thereof. Although the present invention has 
been described in substantial detail with reference to one or more specific embodiments, 
those of skill in the art will recognize that changes may be made to the embodiments 
specifically disclosed in this application, yet these modifications and improvements are 
within the scope and spirit of the invention^ as set forth in the claims which follow. All 
publications or patent documents cited in this specification are incorporated herein by 
reference as if each such publication or document was specifically and individually 
indicated to be incorporated herein by reference. 

[0041] Citation of the above publications or documents is not intended as an 
admission that any of the foregoing is pertinent prior art, nor does it constitute any 
admission as to the contents or date of these publications or documents. 
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CLAIMS 

What is claimed is: 

1 . A method for altering substrate specificity of an acyltransferase domain of 
a modular polyketide synthase, said method comprising changing one or more amino acid 
residues in one or more regions of said domain, said regions selected from one or more of 
the group consisting of (i) region 1, inmiediately upstream of a highly conserved Gin 
residue in an active site; (ii) region 2, a single amino acid adjacent to an active site serine 
residue; and (iii) region 3, adjacent to a highly conserved histidine residue that lies near 
the active site serine in three dimensional space. 

2. The method of claim 1, wherein said substrate specificity is changed from 
methylmalonyl to malonyl. 

3. A polyketide that is 6-Desmethyl-6-deoxyerythronolide B. 

4. A hydroxylated and/or glycosylated derivative of the compound of Claim 
3. 

5. The derivative of Claim 4 that is 6-desmethyl-erythromycin A. 

6. The derivative of Claim 4 that is 6-desmethyl-erythromycin B. 

7. The derivative of Claim 4 that is 6-desmethyl-erythromycin C. 

8. The derivative of Claim 4 that is 6-desmethyl-erythromycin 

9. A host cell that comprises a polyketide synthase (PKS), said PKS 
comprising 

an acyltransferase (AT) domain produced from a coding sequence that has been generated 
by site-specific mutagenesis of a naturally occurring coding sequence to change one or 
more amino acid residues in one or more regions of said domain, said regions selected 
from one or more of the group consisting of (i) region 1, immediately upstream of a 
highly conserved Gin residue in an active site; (ii) region 2, a single amino acid adjacent 
to an active site serine residue; and (iii) region 3, adjacent to a highly conserved histidine 
residue that lies near the active site serine in three dimensional space. 

10. The method of Claim 2, wherein said region is region 1, and an amino acid 
sequence ERVDVLQ is changed to EDVLYAQ, EDDLYAQ, or EDTLYAQ. 

1 1 . The method of Claim 2, wherein said region is region 2, and an amino acid 
sequence GHSQGEI, is changed to GHSLGEI, GHSVGEI, or GHSIGEL 
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12. The method of Claim 2, wherein said region is region 3, and an amino acid 
sequence VRYASHS is changed to VRHAFHS, or VRHGFHS. 

13. The method of Claim 1, wherein said region is region 3, and an amino acid 
sequence His-X-Phe-His is changed to Tyr-X-Ser-His, Tyr-Ala-Ser-His, or Tyr-Gly-Ser- 

5 His. 

14. The method of Claim 1, wherein said region is region 3, and an amino acid 
sequence His-X-Phe-His is changed to Thr-Ala-Gly-His or Cys-Pro-Thr-His. 

15. The method of Claim 1, wherein said region is region 3, and an amino acid 
sequence Tyr-X-Ser-His is changed to His-X-Phe-His, His-Ala-Phe-His, or His-Gly-Phe- 

10 His. 

16. The method of Claim 1, wherein said region is region 3, and an amino acid 
sequence Tyr-X-Ser-His is changed to Thr-Ala-Gly-His or Cys-Pro-Thr-His. 

17. The method of Claim 1, wherein said region is region 3, and an amino acid 
sequence selected from the group consisting of His-X-Phe-His, His-Ala-Phe-His, Tyr-X- 

1 5 Ser-His, and Val-X-Ser-His, is changed to an amino acid sequence selected from the 
group consisting of Thr-Ala-Gly-His, Thr-Ala-Ser-His, Cys-Pro-Thr-His, Phe-Ala-Gly- 
His, and Val-Ala-GIy-His. 

18. The method of Claim 1, wherein said region is region 3, and an amino acid 
sequence selected from the group consisting of Thr-Ala-Gly-His, Thr-Ala-Ser-His, Cys- 

20 Pro-Thr-His, Phe-Ala-Gly-His, and Val-Ala-Gly-His, is changed to an amino acid 

sequence selected from the group consisting of His-X-Phe-His, His-Ala-Phe-His, Tyr-X- 
Ser-His, and Val-X-Ser-His. 

19. A Streptomyces host cell that produces the compound of Claim 3 . 

20. A Saccharopolyspora erythraea host cell that produces the compound of 
25 Claims. 
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